Integration of a memory transistor into high-k, metal gate cmos process flow

ABSTRACT

Memory cells including embedded SONOS based non-volatile memory (NVM) and MOS transistors and methods of forming the same are described. Generally, the method includes: forming a gate stack of a NVM transistor in a NVM region of a substrate including the NVM region and a plurality of MOS regions; and depositing a high-k dielectric material over the gate stack of the NVM transistor and the plurality of MOS regions to concurrently form a blocking dielectric comprising the high-k dielectric material in the gate stack of the NVM transistor and high-k gate dielectrics in the plurality of MOS regions. In one embodiment, a first metal layer is deposited over the high-k dielectric material and patterned to concurrently form a metal gate over the gate stack of the NVM transistor, and a metal gate of a field effect transistor in one of the MOS regions.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 15/080,997, filed Mar. 25, 2016, which is a continuation of U.S. patent application Ser. No. 14/516,794, filed on Oct. 17, 2014, which is a continuation of U.S. patent application Ser. No. 14/229,594, filed on Mar. 28, 2014, now U.S. Pat. No. 8,883,624, issued on Nov. 11, 2014, and claims the benefit of priority under 35 U.S.C. 119(e) to U.S. Provisional Patent Application No. 61/883,873, filed Sep. 27, 2013, all of which are incorporated by reference herein in its entirety.

TECHNICAL FIELD

The present disclosure relates generally to semiconductor devices, and more particularly to memory cells including embedded or integrally formed SONOS based non-volatile memory (NVM) transistors and metal-oxide-semiconductor (MOS) transistors including high-k dielectrics and metal gates and methods for fabricating the same.

BACKGROUND

For many applications, such as system-on-chip, it is desirable to integrate logic devices and interface circuits based upon metal-oxide-semiconductor (MOS) field-effect transistors and non-volatile memory (NVM) transistors on a single chip or substrate. This integration can seriously impact both the MOS transistor and NVM transistor fabrication processes. MOS transistors are typically fabricated using a standard or baseline complementary-metal-oxide-semiconductor (CMOS) process flows, involving the formation and patterning of conducting, semiconducting and dielectric materials. The composition of these materials, as well as the composition and concentration of processing reagents, and temperature used in such a CMOS process flow are stringently controlled for each operation to ensure the resultant MOS transistors will function properly.

Non-volatile memory (NVM) devices include non-volatile memory transistors, silicon-oxide-nitride-oxide-semiconductor (SONOS) based transistors, including charge-trapping gate stacks in which a stored or trapped charge changes a threshold voltage of the non-volatile memory transistor to store information as a logic 1 or 0. Charge-trapping gate stack formation involves the formation of a nitride or oxynitride charge-trapping layer sandwiched between two dielectric or oxide layers typically fabricated using materials and processes that differ significantly from those of the baseline CMOS process flow, and which can detrimentally impact or be impacted by the fabrication of the MOS transistors.

In particular, forming a gate oxide or dielectric of a MOS transistor can significantly degrade performance of a previously formed charge-trapping gate stack by altering a thickness or composition of the charge-trapping layer. At 28 nm and beyond, CMOS technologies will switch to using a thin High-k dielectric in place of the silicon dioxide or silicon oxynitride and metal gate instead of polysilicon. The process flow for these elements is significantly different than the current CMOS and NVM process flows. In addition, this integration can seriously impact the baseline CMOS process flow, and generally requires a substantial number of mask sets and process steps, which add to the expense of fabricating the devices and can reduce yield of working devices.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the present embodiment will be understood more fully from the detailed description that follows and from the accompanying drawings and the appended claims provided below, where:

FIG. 1 is a flowchart illustrating an embodiment of a method for fabricating a memory cell including a non-volatile memory (NVM) transistor and metal-oxide-semiconductor (MOS) transistors including a high-k dielectric and a metal gate in a gate first scheme;

FIGS. 2A-2N are block diagrams illustrating cross-sectional views of a portion of a memory cell during fabrication of the memory cell according to the method of FIG. 1;

FIG. 3 is a flowchart illustrating another embodiment of another method for fabricating a memory cell including a NVM transistor and MOS transistors including a high-k dielectric and a metal gate in a gate last scheme;

FIGS. 4A-4I are block diagrams illustrating cross-sectional views of a portion of a memory cell during fabrication of the memory cell according to the method of FIG. 3;

FIG. 5 is a flowchart illustrating an yet another embodiment of a method for fabricating a memory cell including a NVM transistor and MOS transistors including a high-k dielectric in a trapping layer;

FIGS. 6A-6F are block diagrams illustrating cross-sectional views of a portion of a memory cell during fabrication of the memory cell according to the method of FIG. 5;

FIG. 7 is a flowchart illustrating an yet another embodiment of a method for fabricating a memory cell including a NVM transistor and MOS transistors including a high-k dielectric in a tunnel dielectric; and

FIGS. 8A-8E are block diagrams illustrating cross-sectional views of a portion of a memory cell during fabrication of the memory cell according to the method of FIG. 7.

DETAILED DESCRIPTION

Embodiments of methods of integrating a non-volatile memory (NVM) transistor into a complementary metal-oxide-semiconductor (CMOS) fabrication process or process flow including metal-oxide-semiconductor-field-effect-transistors (MOSFETs) with a high dielectric constant (high-k) gate dielectric and a metal gate to produce memory cells are described herein with reference to figures. However, particular embodiments may be practiced without one or more of these specific details, or in combination with other known methods, materials, and apparatuses. In the following description, numerous specific details are set forth, such as specific materials, dimensions and processes parameters etc. to provide a thorough understanding of the present embodiment. In other instances, well-known semiconductor design and fabrication techniques have not been described in particular detail to avoid unnecessarily obscuring the present embodiment. Reference throughout this specification to “an embodiment” means that a particular feature, structure, material, or characteristic described in connection with the embodiment is included in at least one embodiment of the patent document. Thus, the appearances of the phrase “in an embodiment” in various places throughout this specification are not necessarily referring to the same embodiment of the patent document. Furthermore, the particular features, structures, materials, or characteristics may be combined in any suitable manner in one or more embodiments.

The terms “over,” “under,” “between,” and “on” as used herein refer to a relative position of one layer with respect to other layers. As such, for example, one layer deposited or disposed over or under another layer may be directly in contact with the other layer or may have one or more intervening layers. Moreover, one layer deposited or disposed between layers may be directly in contact with the layers or may have one or more intervening layers. In contrast, a first layer “on” a second layer is in contact with that second layer. Additionally, the relative position of one layer with respect to other layers is provided assuming operations deposit, modify and remove films relative to a starting substrate without consideration of the absolute orientation of the substrate.

The NVM transistor may include memory transistors or devices implemented using Silicon-Oxide-Nitride-Oxide-Silicon (SONOS) or Metal-Oxide-Nitride-Oxide-Silicon (MONOS) technology.

An embodiment of a method for integrating or embedding a NVM transistor into a high-k, metal gate CMOS process flow will now be described in detail with reference to FIG. 1 and FIGS. 2A through 2N. FIG. 1 is a flowchart illustrating an embodiment of a gate-first method or process flow. FIGS. 2A-2N are block diagrams illustrating cross-sectional views of a portion of a memory cell 200, including a NVM transistor and metal-oxide-semiconductor (MOS) transistors, during fabrication of the memory cell according to the method of FIG. 1.

Referring to FIG. 1 and FIG. 2A, the process begins with forming a number of isolation structures 202 in a wafer or substrate 204 (step 102). The isolation structures 202 isolate the memory cell being formed from memory cells formed in adjoining areas (not shown) of the substrate 204 and/or isolate the NVM transistor 206 being formed in a NVM region 208 of the substrate from one or more of the MOS transistors 210 a-210 c being formed in multiple adjoining MOS regions 212 a-212 c. The isolation structures 202 include a dielectric material, such as an oxide or nitride, and may be formed by any conventional technique, including but not limited to shallow trench isolation (STI) or local oxidation of silicon (LOCOS). The substrate 204 may be a bulk wafer composed of any single crystal or polycrystalline material suitable for semiconductor device fabrication, or may include a top epitaxial layer of a suitable material formed on a substrate. Suitable materials include, but are not limited to, silicon, germanium, silicon-germanium or a III-V compound semiconductor material.

Generally, as in the embodiment shown, a pad oxide 214 is formed over a surface 216 of the substrate 204 in both the NVM region 208 and the MOS regions 212 a-212 c. The pad oxide 214 can be silicon dioxide (SiO₂) having a thickness of from about 10 nanometers (nm) to about 20 nm and can be grown by a thermal oxidation process or in-situ steam generation (ISSG).

Referring to FIG. 1 and FIG. 2B, dopants are then implanted into substrate 204 through the pad oxide 214 to concurrently form wells in the NVM region 208 and one or more of the MOS regions 212 a-c, and to form channels 218 for MOS transistors that will be formed in the MOS regions (step 104). The dopants implanted may be of any type and concentration, and may be implanted at any energy, including energies necessary to form wells or deep wells for an NVM transistor 206 and/or MOS transistors 210 a-210 c, and to form channels for MOS transistors. In a particular embodiment illustrated in FIG. 2B, dopants of an appropriate ion species are implanted to form a deep N-well 220 in the NVM region and in a MOS region 212 b over or in which a high-voltage (HV) transistor, such as an input/output field effect transistor (I/O FET), will be formed. Although not shown, it is to be understood that wells or deep wells can also be formed for the standard or low-voltage transistor, such as a low voltage field effect transistor (LVFET), in MOS regions 212 a and 212 c. The LVFET can be a PMOS LVFET (PLVFET) or a NMOS LVFET (NLVFET) and the dopants for the well selected accordingly. It is further to be understood that the wells are formed by depositing and patterning a mask layer, such as a photoresist or PR layer above the surface 216 of the substrate 204, and implanting an appropriate ion species at an appropriate energy to an appropriate concentration.

Channels 218 for one or more of the MOS transistors 210 a-210 c, are formed in one or more of the MOS regions 212 a-c of the substrate 204. As with the well implant the channels 218 are formed by depositing and patterning a mask layer, such as a photoresist layer above the surface 216 of the substrate 204, and implanting an appropriate ion species at an appropriate energy to an appropriate concentration. For example, BF₂ can be implanted at an energy of from about 10 to about 100 kilo-electron volts (keV), and a dose of from about 1e12 cm⁻² to about 1e14 cm⁻² to form an N-type MOS (NMOS) transistor. A P-type MOS (PMOS) transistor may likewise be formed by implantation of Arsenic or Phosphorous ions at any suitable dose and energy. It is to be understood that implantation can be used to form channels 218, in all of the MOS regions 212 a-c at the same time, or at separate times using standard lithographic techniques, including a patterned photoresist layer to mask one of the MOS regions.

Next, referring to FIG. 1 and FIG. 2C a patterned tunnel mask 222 is formed on or overlying the MOS regions 212 a-c, and dopants of an appropriate, energy and concentration are implanted through a window or opening in the tunnel mask to form a channel 224 for a NVM transistor 206, and the tunnel mask and the pad oxide 214 in at least the NVM region 208 removed (step 106). The tunnel mask can include a photoresist layer, or a hard mask formed, from a patterned nitride or silicon-nitride layer.

In one embodiment, the channel 224 can be implanted with boron ions (BF₂) at an energy of from about 50 to about 500 kilo-electron volts (keV), and a dose of from about 5e11 m⁻² to about 5e12 cm⁻² to form a p-channel NVM transistor 206. Alternatively, Arsenic or Phosphorous can be implanted through the pad oxide 214 to form a n-channel NVM transistor 206.

The pad oxide 214 over the NVM region 208 is removed through the mask 222, for example in a wet clean process using a 10:1 buffered oxide etch (BOE) containing a surfactant. Alternatively, the wet clean process can be performed using a 20:1 BOE wet etch, a 50:1 hydrofluoric (HF) wet etch, a pad etch, or any other similar hydrofluoric-based wet etching chemistry. The photoresist tunnel mask 222 can be ashed or stripped using oxygen plasma. A hard mask can be removed using a wet or dry etch process.

Referring to FIG. 1 and FIGS. 2D-2E, a number of dielectric or oxide-nitride-oxide (ONO) layers, shown collectively as ONO layers 226, are formed or deposited over the surface 216 of the substrate 204, a mask formed on or overlying the ONO layers, and the ONO layers etched to form a gate stack 228 of a NVM transistor 206 in the NVM region 208 (step 108). Optionally, this step can be preceded by a preclean accomplished using a wet or dry process. In one particular embodiment, the preclean includes a wet process using HF or standard cleans (SC1) and SC2 that are highly selective to the material of the substrate 204. SC1 is typically performed using a 1:1:5 solution of ammonium hydroxide (NH₄OH), hydrogen peroxide (H₂O₂) and water (H₂O) at 50 to 80° C. for about 10 minutes. SC2 is a short immersion in a 1:1:10 solution of HC1, H₂O₂ and H₂O at about 50 to 80° C.

Referring to FIG. 2E, the dielectric or ONO deposition begins with the formation of a tunnel dielectric 230 over at least the channel 224 of a NVM transistor 206 in the NVM region 208 of the substrate 204. The tunnel dielectric 230 may be any material and have any thickness suitable to allow charge carriers to tunnel into an overlying charge-trapping layer under an applied gate bias while maintaining a suitable barrier to leakage when the NVM transistor 206 is unbiased. In certain embodiments, tunnel dielectric 230 is silicon dioxide, silicon oxy-nitride, or a combination thereof and can be grown by a thermal oxidation process, using ISSG or radical oxidation.

In one embodiment a silicon dioxide tunnel dielectric 230 may be thermally grown in a thermal oxidation process. For example, a layer of silicon dioxide may be grown utilizing dry oxidation at 750 degrees centigrade (° C.) −800° C. in an oxygen containing gas or atmosphere, such as oxygen (O₂) gas. The thermal oxidation process is carried out for a duration approximately in the range of 50 to 150 minutes to effect growth of a tunnel dielectric 230 having a thickness of from about 1.0 nanometers (nm) to about 3.0 nm by oxidation and consumption of the exposed surface of substrate.

In another embodiment a silicon dioxide tunnel dielectric 230 may be grown in a radical oxidation process involving flowing hydrogen (H₂) and oxygen (O₂) gas into a processing chamber at a ratio to one another of approximately 1:1 without an ignition event, such as forming of a plasma, which would otherwise typically be used to pyrolyze the H₂ and O₂ to form steam. Instead, the H₂ and O₂ are permitted to react at a temperature approximately in the range of about 900° C. to about 1000° C. at a pressure approximately in the range of about 0.5 to about 5 Torr to form radicals, such as, an OH radical, an HO₂ radical or an O diradical, at the surface of substrate. The radical oxidation process is carried out for a duration approximately in the range of about 1 to about 10 minutes to effect growth of a tunnel dielectric 230 having a thickness of from about 1.0 nanometers (nm) to about 4.0 nm by oxidation and consumption of the exposed surface of substrate. It will be understood that in this and in subsequent figures the thickness of tunnel dielectric 230 is exaggerated relative to the pad oxide 214, which is approximately 7 times thicker, for the purposes of clarity. A tunnel dielectric 230 grown in a radical oxidation process is both denser and is composed of substantially fewer hydrogen atoms/cm³ than a tunnel dielectric formed by wet oxidation techniques, even at a reduced thickness. In certain embodiments, the radical oxidation process is carried out in a batch-processing chamber or furnace capable of processing multiple substrates to provide a high quality tunnel dielectric 230 without impacting the throughput (wafers/hr.) requirements that a fabrication facility may require.

In another embodiment, tunnel dielectric 230 is deposited by chemical vapor deposition (CVD) or atomic layer deposition and is composed of a dielectric layer which may include, but is not limited to silicon dioxide, silicon oxy-nitride, silicon nitride, aluminum oxide, hafnium oxide, zirconium oxide, hafnium silicate, zirconium silicate, hafnium oxy-nitride, hafnium zirconium oxide and lanthanum oxide. In another embodiment, tunnel dielectric 230 is a multilayer tunnel dielectric including at least a bottom layer of a material such as, but not limited to, silicon dioxide or silicon oxy-nitride and a top layer of a material which may include, but is not limited to silicon nitride, aluminum oxide, hafnium oxide, zirconium oxide, hafnium silicate, zirconium silicate, hafnium oxy-nitride, hafnium zirconium oxide and lanthanum oxide.

Referring again to FIG. 2E, a charge-trapping layer 232 is formed on or overlying the tunnel dielectric 230. Generally, as in the embodiment shown, the charge-trapping layer is a multilayer charge-trapping layer comprising multiple layers including at least an oxygen-rich, substantially charge trap free lower or first charge-trapping layer 232 a closer to the tunnel dielectric 230, and an upper or second charge-trapping layer 232 b that is silicon-rich and oxygen-lean relative to the first charge-trapping layer and comprises a majority of a charge traps distributed in multilayer charge-trapping layer.

The first charge-trapping layer 232 a of a multilayer charge-trapping layer 232 can include a silicon nitride (Si₃N₄), silicon-rich silicon nitride or a silicon oxy-nitride (SiO_(x)N_(y) (H_(z))). For example, the first charge-trapping layer 232 a can include a silicon oxynitride layer having a thickness of between about 1.5 nm and about 4.0 nm formed by a CVD process using dichlorosilane (DCS)/ammonia (NH₃) and nitrous oxide (N₂O)/NH₃ gas mixtures in ratios and at flow rates tailored to provide a silicon-rich and oxygen-rich oxynitride layer.

The second charge-trapping layer 232 b of the multilayer charge-trapping layer is then formed over the first charge-trapping layer 232 a. The second charge-trapping layer 232 b can include a silicon nitride and silicon oxy-nitride layer having a stoichiometric composition of oxygen, nitrogen and/or silicon different from that of the first charge-trapping layer 232 a. The second charge-trapping layer 232 b can include a silicon oxynitride layer having a thickness of between about 2.0 nm and about 10.0 nm, and may be formed or deposited by a CVD process using a process gas including DCS/NH₃ and N₂O/NH₃ gas mixtures in ratios and at flow rates tailored to provide a silicon-rich, oxygen-lean top nitride layer.

As used herein, the terms “oxygen-rich” and “silicon-rich” are relative to a stoichiometric silicon nitride, or “nitride,” commonly employed in the art having a composition of (Si₃N₄) and with a refractive index (RI) of approximately 2.0. Thus, “oxygen-rich” silicon oxynitride entails a shift from stoichiometric silicon nitride toward a higher wt. % of silicon and oxygen (i.e. reduction of nitrogen). An oxygen rich silicon oxynitride film is therefore more like silicon dioxide and the RI is reduced toward the 1.45 RI of pure silicon dioxide. Similarly, films described herein as “silicon-rich” entail a shift from stoichiometric silicon nitride toward a higher wt. % of silicon with less oxygen than an “oxygen-rich” film. A silicon-rich silicon oxynitride film is therefore more like silicon and the RI is increased toward the 3.5 RI of pure silicon.

Referring again to FIG. 2E, the number of dielectric layers further includes a blocking dielectric layer or blocking dielectric 234 that is formed on or overlying the charge-trapping layer 232. In one embodiment, the blocking dielectric 234 can include an oxidized portion of the silicon nitride of the underlying second charge-trapping layer 232 b, which is subsequently oxidized by in-situ-steam-generation (ISSG), or radical oxidation to form the blocking dielectric 234. In other embodiments, the blocking dielectric 234 can include a silicon oxide (SiO₂) or a silicon oxynitride (SiON), deposited by CVD, performed in a batch or single substrate processing chamber with or without an ignition event such as plasma. The blocking dielectric 234 can be a single layer of silicon oxide, having a substantially homogeneous composition, a single layer of silicon oxynitride having a gradient in stoichiometric composition, or, as in embodiments described below, can be a multilayer blocking dielectric including at least a lower or first blocking dielectric layer overlying the second charge-trapping layer 232 b, and a second blocking dielectric layer overlying the first blocking dielectric layer.

In one embodiment, the blocking dielectric 234 can include a silicon nitride, a silicon-rich silicon nitride or a silicon-rich silicon oxynitride layer having a thickness of between 2.0 nm and 4.0 nm formed by a CVD process using N₂O/NH₃ and DCS/NH₃ gas mixtures.

Referring to FIGS. 1 and 2F, a gate oxide or GOX preclean is performed, and gate oxides for MOS transistors 210 a-210 c formed in the MOS regions 212 a-c (step 110). Referring to FIG. 2F, the GOX preclean removes the pad oxide 214 from the MOS regions and at least a portion of the blocking dielectric 234 in a highly selective cleaning process. This cleaning process prepares the substrate 204 in the MOS regions 212 a-c for gate oxide growth. In one exemplary implementation the pad oxide 214 is removed in a wet clean process. Alternatively, the wet clean process can be performed using a 20:1 BOE wet etch, a 50:1 hydrofluoric (HF) wet etch, a pad etch, or any other similar hydrofluoric-based wet etching chemistry. In other embodiments, the cleaning process chemistry is chosen so as to remove only a negligible portion of the blocking dielectric 234.

In some embodiments, such as that shown in FIG. 2F, the oxidation process to form gate oxides for MOS transistors 210 a-210 c is a dual gate oxidation process to enable fabrication of both a first, thick, gate oxide 236 over the surface 216 of the substrate 204 in one MOS region 212 b for a HV transistor, such as I/O FET 210 b, and second, thinner gate oxides 238 for LV transistors, such as NLVFET 210 a and PLVFET 210 c, in the remaining MOS regions 212 a and 212 c. Generally, the dual gate oxidation process involves forming the thicker gate oxide 236 over all MOS regions 212 a-212 c, using any known oxidation process in accordance with the methods described herein, forming a patterned photoresist mask using standard lithographic techniques covering MOS region 212 b and NVM region 208, and removing the thick gate oxide in MOS regions 212 a and 212 c by a wet etch process using a 10:1 buffered oxide etch (BOE) containing a surfactant, after which the photoresist mask is stripped or removed, and the second, thinner gate oxides 238 grown or deposited. The thinner gate oxides 238 can be grown, for example, to a thickness from about 1 nm to about 3 nm. It will be understood that, by controlling the thickness of the thick gate oxide 236 as initially formed there is no need to form an additional photoresist mask over the MOS region 212 b since the additional oxide merely adds insubstantially to the thickness of the thick gate oxide. Similarly, the oxidation process to form the thinner gate oxides 238 will have little to no detrimental impact on the blocking dielectric 234.

In another embodiment, the oxidation process to form the thick gate oxide 236 is also used to concurrently form a high-temperature-oxide (HTO) over the gate stack 228 of the NVM transistor 206 to provide a thicker oxide blocking dielectric 234 or an additional HTO layer of a multilayer blocking dielectric. The oxidation process can include in-situ-steam-generation (ISSG), CVD, or radical oxidation performed in a batch or single substrate processing chamber with or without an ignition event such as plasma. For example, in one embodiment the thick gate oxide 236 and the additional or thicker oxide layer of the blocking dielectric 234 may be grown in a radical oxidation process involving flowing hydrogen (H₂) and oxygen (O₂) gas into a processing chamber at a ratio to one another of approximately 1:1 without an ignition event, such as forming of a plasma, which would otherwise typically be used to pyrolyze the H₂ and O₂ to form steam. Instead, the H₂ and O₂ are permitted to react at a temperature approximately in the range of 800-1000° C. at a pressure approximately in the range of 0.5-10 Torr to form radicals, such as, an OH radical, an HO₂ radical or an O diradical radicals at a surface of the blocking dielectric 234. The oxidation process is carried out for a duration approximately in the range of 1-5 minutes for a single substrate using an ISSG process, or 10-15 minutes for a batch furnace process to effect growth of the blocking dielectric 234 having a thickness of from about 2 nm to about 4.5 nm, and a thick gate oxide 236 having a thickness of from about 3 nm to about 7 nm.

Next, referring to FIGS. 1 and 2G, a high dielectric constant or high-k dielectric material 240 is formed or deposited on or over the gate stack of the NVM transistor 206, in the NVM region 208 and in the MOS regions 212 a-c to concurrently form a multilayer blocking dielectric 234 including the high-k dielectric material in the gate stack 228 of the NVM transistor and multilayer gate dielectrics including the gate oxides 236 or 238, and the high-k dielectric material in the MOS regions (step 112). The high-k dielectric material 240 may include, but is not limited to, hafnium oxide, zirconium oxide, hafnium silicate, hafnium oxy-nitride, hafnium zirconium oxide and lanthanum oxide deposited to a physical thickness between about 3.0 nm and about 8.0 nm by, for example, atomic layer deposition (ALD), physical vapor deposition (PVD), a chemical vapor deposition (CVD), a low pressure CVD (LPCVD) or a plasma enhanced CVD (PECVD) process.

Referring to FIGS. 1 and 2H, metal layers of multi-layer gates are formed for the MOS transistors 210 a-210 c and, optionally, over the NVM transistor 206 (step 114). In one embodiment, a first or P+ metal layer 241 (high work function metal) is deposited over substantially the entire surface of the substrate 204 and all layers and structures formed thereon, a patterned photoresist mask (not shown) formed using standard lithographic techniques and the P+ metal layer etched to remove the first or P+ metal layer from MOS regions 210 a and 210 b, stop on surfaces of the high-k dielectric material 240, thereby forming a high work function gate 242 for a P-type low voltage MOS transistor (PLVFET 210 c) and, optionally, a high work function gate 244 for the NVM transistor 206. The P+ metal layer 241 can include aluminum, titanium or compounds or alloys thereof, deposited to a thickness of from about 20 nm to about 100 nm, using physical or chemical vapor deposition. Next, a second or N+ metal layer 245 (low work function) is deposited over substantially the entire surface of the substrate 204, including the gate 242 of the PLVFET 210 c and the gate 244 of the NVM transistor 206, a patterned photoresist mask formed and the N+ metal layer etched to form a non-high or low work function metal gate 246 for a N-type low voltage MOS transistor (NLVET 210 a), and a low work function metal gate 248 for the I/O FET 210 b. Optionally, if a high work function gate for the NVM transistor 206 has not been formed from the first or P+ metal layer 241, a low work function gate 244 may instead be concurrently formed for the NVM transistor 206. The N+ metal layer 245 can include Titanium, Lanthanum, Aluminum, or compounds or alloys thereof, deposited to a thickness of from about 20 nm to about 100 nm, using physical or chemical vapor deposition.

Next, referring to FIGS. 1 and 21, a polysilicon or poly layer is deposited or formed over substantially the entire surface of the substrate 204 and all layers and structures formed thereon, a patterned photoresist mask (not shown) formed using standard lithographic techniques and the polysilicon layer and the underlying metal layers 241 and 245 etched to stop on surfaces of the high-k dielectric material 240, thereby forming metal-polysilicon gates 250 of the MOS transistors 210 a-c and the NVM transistor 206 (step 116). The polysilicon layer can be deposited using chemical vapor deposition (CVD) to a thickness of from about 30 nm to about 100 nm, and etched using standard polysilicon etch chemistries, such as CHF₃ or C₂H₂ or HBr/O₂ which are highly selective to the underlying metal, followed by a metal etch which is highly selective to the material of the high-k dielectric material 240.

Referring to FIGS. 1 and 2J, a first spacer layer is deposited and etched to form first sidewall spacers 252 adjacent to the polysilicon gates 250 and the metal gates 242, 244, 246 and 248, of the MOS transistors 210 a-c and the NVM transistor 206, and one or more lightly-doped drain extensions (MOS LDD 254) are implanted adjacent to and one or more of the MOS transistors (step 118). The first spacer layer can include silicon oxide, deposited to a thickness of from about 10 nm to about 30 nm, using any known CVD technique as described herein. The MOS LDDs 254 are formed by implanting an appropriate ion species at an appropriate energy to an appropriate concentration. For example, drain extensions 254 of the PLVFET 210 a can be formed by forming a photoresist mask through which MOS region 212 c is exposed and implanting boron ions (BF₂) at an energy of from about 10 to about 100 kilo-electron volts (keV), and a dose of from about 1e12 cm⁻² to about 5e14 cm⁻² through the photoresist mask. Optionally, pocket or halo implants (not shown) for the PLVFET 210 c can be done through the same photoresist mask, by implanting Arsenic or Phosphorus at energy of 20 to 70 kilo-electron volts (KeV) and a dose of 2e12 cm⁻² to about 5e12 cm⁻². Similarly, MOS LDDs 254 of the NLVET 210 a and the I/O FET 210 b can be formed by implanting Arsenic or Phosphorus at energy of from about 10 to about 100 kilo-electron volts (keV), and a dose of from about 1e12m⁻² to about 5e14 cm⁻², also through an appropriately patterned photoresist mask. Halo or pocket implants for the NLVFET can also be done through this mask using Boron (BF₂) at energy of 5 to about 50 kilo-electron volts and a dose of 1e12 cm⁻²to 5e12 cm⁻².

Next, referring to FIGS. 1 and 2K a ONO LDD mask is formed over the substrate 204, lightly-doped drain extensions (ONO LDD 256) are implanted, adjacent to the NVM transistor 206, SONOS pocket or halo implants 258 extending partially into the channel region 224 under the gate stack 228 of the NVM transistor implanted., The ONO LDD 256 and the sidewall spacers 252 can be formed using techniques substantially the same as those described above with respect to the MOS LDD 254 and the first sidewall spacers 252. For example, in one embodiment the LDD implants 256 can be formed by an angled implant of, for example, Arsenic or Phosphorous at an energy of from about 5 to about 25 kilo-electron volts (keV), and a dose of from about 5e12 cm⁻² to about 2e14 cm−². Pocket or halo implants 258 can be formed by implanting (BF₂) with energy of 10 to 30 kilo-electron volts and a dose of 1e12 cm⁻² to 3e12 cm⁻². A second spacer layer is deposited and etched to form second sidewall spacers 260 adjacent to the first sidewall spacers 252, of the NVM transistor and MOS transistors (step 120).

Referring to FIGS. 1 and 2L, source and drain implants are performed to form source and drain (S/D) regions 262 for the NVM transistor 206 and all of the MOS transistors 210 a-c, a hard mask (HM) formed and patterned to expose only the S/D regions of the PLVFET 210 c, a silicon-germanium (SiGe) layer deposited and etched, and the hard mask removed to form a strain inducing layer 264 over the S/D regions of the PLVFET (step 122). Additionally, as depicted, a silicide process can be performed to form silicide 266 on the exposed source and drain regions 262. The silicide process may be any commonly employed in the art, typically including a pre-clean etch, nickel metal deposition, anneal and wet strip.

Referring to FIGS. 1 and 2M, the method further includes forming a stress inducing structure or layer 268, such as a stress inducing nitride layer, over the MOS transistors 210 a-c, depositing an interlevel dielectric (ILD) layer 270 over substantially the entire surface of the substrate 204 and all layers and structures formed thereon, and the ILD layer planarized, for example, using a chemical mechanical polishing (CMP) process (step 124). The stress inducing layer 268 can include, a compressive or tensile nitride layer formed using a plasma enhanced chemical vapor deposition (PECVD) or a Bis-TertiaryButylAmino Silane (BTBAS) nitride layer, deposited or grown to a thickness of from about 30 nm to about 70 nm, using any known technique including chemical vapor deposition. The ILD layer 270 can include, for example, silicon oxide, deposited or grown to a thickness of from about 0.5 μm to about 1.0 μm, using any known CVD technique as described above.

Referring FIGS. 1 and 2N, a second ILD layer 274 is deposited over substantially the entire surface of the substrate 204 and all layers and structures formed thereon, and contacts 276 are formed to the S/D regions and gates for the NVM transistor and all of the MOS transistors (step 126). The second ILD layer 274 can include, for example, silicon oxide, deposited or grown to a thickness of from about 0.5 μm to about 1.0 μm, using any known CVD technique as described above. In an alternate embodiment, the second ILD layer 274 can be substantially reduced or omitted entirely, and the contacts 276 formed through just the first ILD layer 272. The contacts 276 can be formed by forming a patterned PR mask over the second ILD layer 274, etching the second ILD layer using any of the standard oxide etch processes as described above to stop on the silicide 266. The contact openings thus formed are then filled with a metal, such as tungsten, using chemical vapor deposition.

Finally, the standard or baseline CMOS process flow is continued to substantially complete the front end device fabrication (step 128), yielding the structure shown in FIG. 2N.

An embodiment of another method for integrating or embedding a NVM transistor into a high-k, metal gate CMOS process flow will now be described in detail with reference to FIG. 3 and FIGS. 4A through 41. FIG. 3 is a flowchart illustrating an embodiment of a gate-last method or process flow. FIGS. 4A-41 are block diagrams illustrating cross-sectional views of a portion of a memory cell 200, including a NVM transistor and MOS transistors, during fabrication of the memory cell according to the method of FIG. 3.

Referring to FIG. 3, as with the gate-first method described above the process begins with forming a number of isolation structures 202 in a wafer or substrate 204 (step 302). At this point the memory cell 200 is substantially identical to that described above and shown in FIG. 2A.

Next, referring to FIG. 3, dopants are then implanted into substrate 204 through the pad oxide 214 to concurrently form wells in the NVM region 208 and one or more of the MOS regions 212 a-c, and to form channels 218 for MOS transistors that will be formed in the MOS regions (step 304). At this point the memory cell 200 is substantially identical to that described above and shown in FIG. 2B.

Referring to FIG. 3 a patterned tunnel mask 222 is formed on or overlying the MOS regions 212 a-c, and dopants of an appropriate, energy and concentration are implanted through a window or opening in the tunnel mask to form a channel 224 for a NVM transistor 206, and the tunnel mask and the pad oxide in at least the NVM region 208 removed (step 306). At this point the memory cell 200 is substantially identical to that described above and shown in FIG. 2C.

Next referring to FIG. 3 a number of dielectric or oxide-nitride-oxide (ONO) layers, shown collectively as ONO layers 226, are formed or deposited the surface 216 of the substrate 204, a mask formed on or overlying the ONO layers, and the ONO layers etched to form a gate stack 228 of a NVM transistor 206 in the NVM region 208 (step 308). At this point the memory cell 200 is substantially identical to that described above and shown in FIGS. 2D-2E.

Referring to FIG. 3, a gate oxide or GOX preclean is performed, and gate oxides for MOS transistors 210 a-210 c formed in the MOS regions 212 a-c (step 310). At this point the memory cell 200 is substantially identical to that described above and shown in FIG. 2F. In some embodiments, such as that shown in FIG. 2F, the oxidation process is a dual gate oxidation process to enable fabrication of both a first, thick gate oxide 236 over the surface 216 of the substrate 204 in one MOS region 212 b for a HV transistor, such as I/O FET 210 b, and second, thinner gate oxides 238 LV transistors 216, such as NLVFET 210 a and PLVFET 210 c, in the remaining MOS regions 212 a and 212 c.

Next referring to FIGS. 3 and 4A, a polysilicon or poly layer is deposited or formed over substantially the entire surface of the substrate 204 including thick gate oxide 236, the thin gate oxides 238 and the blocking dielectric 234, a patterned photoresist mask (not shown) formed using standard lithographic techniques and the polysilicon layer etched to stop on the surface 216 of the substrate 204, thereby forming dummy polysilicon gates 250 over the gate oxides 236 and 238 of the MOS transistors 210 a-c and the ONO layers 226 in the gate stack 228 of the NVM transistor 206 (step 312). The polysilicon layer can be deposited, masked and etched as described above in connection with the gate-first method and FIG. 21.

Referring to FIGS. 3 and 4B, a first spacer layer is deposited and etched to form first sidewall spacers 252 adjacent to the polysilicon gates 250 and the metal gates 242, 244, 246 and 248, of the MOS transistors 210 a-c and the NVM transistor 206, and one or more lightly-doped drain extensions (MOS LDD 254) are implanted adjacent to and one or more of the MOS transistors (step 314). The first sidewall spacers 252 and MOS LDD 254 can be formed as described above in connection with the gate-first method and FIG. 2J.

Next referring to FIGS. 3 and 4C, a ONO LDD mask is formed over the substrate 204, lightly-doped drain extensions (ONO LDD 256) are implanted, adjacent to the NVM transistor 206, SONOS pocket or halo implants 258 extending partially into the channel region 224 under the gate stack 228 of the NVM transistor implanted, and a second spacer layer is deposited and etched to form second sidewall spacers 260 adjacent to the first sidewall spacers 252, of the NVM transistor (step 316). The ONO LDD 256 and the sidewall spacers 252 can be formed as described above in connection with the gate-first method and FIG. 2K.

Referring to FIGS. 3 and 4D, source and drain implants are performed to form S/D regions 262 for the NVM transistor 206 and all of the MOS transistors 210 a-c, a hard mask formed and patterned to expose only the S/D regions of the PLVFET 210 c, a SiGe layer deposited and etched, the hard mask removed to form a strain inducing layer 264 over the S/D regions of the PLVFET, and a silicide process can be performed to form silicide 266 on the exposed S/D regions 262 (step 318). The strain inducing layer 264 and the silicide 266 can be formed as described above in connection with the gate-first method and FIG. 2L.

Next referring to FIGS. 3 and 4E, the method further includes forming a stress inducing structure or layer 268, such as a stress inducing nitride layer, over the MOS transistors 210 a-c, depositing an ILD layer 270 over substantially the entire surface of the substrate 204 and all layers and structures formed thereon, and the ILD layer planarized using a CMP process to expose the dummy polysilicon gates 250 and the dummy polysilicon gates removed (step 320). The stress inducing structure or layer 268 can be formed as described above in connection with the gate-first method and FIGS. 2M and 2N. The dummy polysilicon gates 250 can be etched or removed using standard polysilicon etch chemistries, such as described above, which are highly selective to the material of the ILD layer 270, the first and second spacers 252, 260, the ONO layers 226 and the gate oxides 236 and 238.

Referring to FIGS. 3 and 4F, a high-k dielectric material 240 is formed or deposited on or over the ONO layers 226 and the gate oxides 236 and 238 exposed by the removal of the dummy polysilicon gates 250 to concurrently form a multilayer blocking dielectric 234 including the high-k dielectric material in the gate stack 228 of the NVM transistor and multilayer gate dielectrics including the gate oxides 236, 238, and the high-k dielectric material in the MOS regions (step 322). The high-k dielectric material 240 may include, but is not limited to, hafnium oxide, zirconium oxide, hafnium silicate, hafnium oxy-nitride, hafnium zirconium oxide and lanthanum oxide deposited to a physical thickness between about 3.0 nm and about 8.0 nm by, for example, physical vapor deposition (PVD), atomic layer deposition (ALD), a chemical vapor deposition (CVD), a low pressure CVD (LPCVD) or a plasma enhanced CVD (PECVD) process.

Referring to FIGS. 3 and 4G, first metal layers of multilayer metal gates are formed for the MOS transistors 210 a-210 c, and, optionally, for the NVM transistor 206 (step 324). In one embodiment, a first or P+ metal layer (high work function) is deposited over substantially the entire surface of the substrate 204 and all layers and structures formed thereon, a patterned photoresist mask (not shown) formed using standard lithographic techniques and the P+ metal layer etched to stop on surfaces of the high-k dielectric material 240, thereby forming a high work function gate 242 for a P-type low voltage MOS transistor (PLVFET 210 c) and a optionally a high work function gate 244 for the NVM transistor 206. Next, a second or N+ metal layer (low work function) is deposited over substantially the entire surface of the substrate 204, including the gate 242 of the PLVFET 210 c, a patterned photoresist mask formed and the N+ metal layer etched to form a low work function metal gate 246 for a N-type low voltage MOS transistor (NLVFET 210 a), a metal gate 248 for the I/O FET 210 b. Optionally, if a high work function gate for the NVM transistor 206 has not been formed from the first or P+ metal layer, a low work function gate 244 may instead be concurrently formed for the NVM transistor 206.

Finally referring to FIG. 3 and FIGS. 4H and 41, a thick gate metal layer 272 is deposited followed by planarization using a CMP process to form a second metal layer completing the formation of multilayer metal gates for the NVM transistor 206 and all of the MOS transistors 210 a-c, a second ILD layer 274 deposited and contacts 276 formed to the S/D regions and gates for the NVM transistor and all of the MOS transistors (step 326). The thick metal layer 272 can include a conformal layer of Aluminum, titanium, titanium-nitride, tungsten or compounds or alloys thereof, deposited to a thickness of from about 0.1 μm to about 0.5 μm, using physical or chemical vapor deposition. The second ILD layer 274 can include, for example, silicon oxide, deposited or grown to a thickness of from about 0.5 μm to about 1 μm, using any known CVD as described above. The contacts 276 can be formed by forming a patterned PR mask over the second ILD layer 274, etching the second ILD layer using any of the standard oxide etch processes as described above to stop on the silicide 266. The contacts 276 can be formed as described above in connection with the gate first method and FIG. 2N.

Another embodiment of a method for integrating or embedding a NVM transistor into a high-k, metal gate CMOS process flow will now be described in detail with reference to FIG. 5 and FIGS. 6A through 6F. FIG. 5 is a flowchart illustrating an embodiment of a method or process flow in which the high-k dielectric material 240 is incorporated into the charge trapping. FIGS. 6A-6F are block diagrams illustrating cross-sectional views of a portion of a memory cell 200, including a NVM transistor and MOS transistors, during fabrication of the memory cell according to the method of FIG. 5.

Referring to FIG. 5, as with the gate-first method described above the process begins with forming a number of isolation structures 202 in a wafer or substrate 204 (step 502). Next, dopants are then implanted into substrate 204 through the pad oxide 214 to concurrently form wells in the NVM region 208 and one or more of the MOS regions 212 a-c, and to form channels 218 for MOS transistors that will be formed in the MOS regions (step 504). A patterned tunnel mask 222 is formed on or overlying the MOS regions 212 a-c, and dopants of an appropriate, energy and concentration are implanted through a window or opening in the tunnel mask to form a channel 224 for a NVM transistor 206, and the tunnel mask and the pad oxide in at least the NVM region 208 removed (step 506). At this point the memory cell 200 is substantially identical to that described above and shown in FIG. 2C.

Next referring to FIG. 5 and FIGS. 6A and 6B, a number of dielectric or oxide and oxynitride or nitride layers, shown collectively as ON layers 278, are formed or deposited the surface 216 of the substrate 204, a mask formed on or overlying the ON layers, and the ON layers etched to form a gate stack 280 of the NVM transistor 206 in the NVM region 208 (step 508). It will be understood that up to this point the memory cell 200 shown in FIGS. 6A and 6B differ from that of the embodiments of FIGS. 2D and 2E in that the gate stack 280 of the NVM transistor 206 does not include the blocking dielectric 234. It will be further understood that as with embodiments described above, the tunnel dielectric 230 and the charge-trapping layer 232 may include one or more layers of material. In particular, the charge-trapping layer 232 may be or include a multilayer charge-trapping layer including at least an oxygen-rich, substantially charge trap free lower or first charge-trapping layer 232 a closer to the tunnel dielectric 230, and an upper or second charge-trapping layer 232 b that is silicon-rich and oxygen-lean relative to the first charge-trapping layer and comprises a majority of a charge traps distributed in multilayer charge-trapping layer.

Next referring to FIGS. 5 and 6C, a gate oxide or GOX preclean is performed, and gate oxides for MOS transistors 210 a-210 c formed in the MOS regions 212 a-c (step 510). Referring to FIG. 2F, in some embodiments, such as that shown, the oxidation process is a dual gate oxidation process to enable fabrication of both a first, thick gate oxide 236 over the surface 216 of the substrate 204 in one MOS region 212 b for a HV transistor, such as I/O FET 210 b, and second thinner gate oxides 238 LV transistors 216, such as NLVFET 210 a and PLVFET 210 c, in the remaining MOS regions 212 a and 212 c. The thick gate oxide 236 and thin gate oxides 238 can be formed as described above in connection with the gate-first method and FIG. 2F.

Next referring to FIG. 5 and FIGS. 6D and 6E, a high dielectric constant or high-k dielectric material 240 is formed or deposited on or over the gate stack 280 of the NVM transistor 206, in the NVM region 208 and in the MOS regions 212 a-c to concurrently form a multilayer charge-trapping layer 232 including the high-k dielectric material and multilayer gate dielectrics including the gate oxides 236, 238, and the high-k dielectric material in the MOS regions (step 512). The high-k dielectric material 240 can include any of the high-k materials described above in connection with the gate-first method and FIG. 2G, and can be deposited by CVD or ALD.

In one embodiment, such as that shown in FIG. 6E, the multilayer charge-trapping layer 232 can include an oxygen-rich, substantially charge trap free lower or first charge-trapping layer 232 a closer to the tunnel dielectric 230, a trap rich, silicon-rich and oxygen-lean upper or second charge-trapping layer 232 b, and the high-k dielectric material 240. It will be appreciated that in some versions of this embodiment the high K layer can also act as an additional charge trapping layer.

Referring to FIGS. 5 and 6F, a blocking dielectric 234 is formed on or overlying the high-k dielectric material 240 and patterned (step 514). In one embodiment, the blocking dielectric 234 can include a silicon oxide (SiO₂) or a silicon oxynitride (SiON), formed by CVD performed in a batch or single substrate processing chamber with or without an ignition event such as plasma. The blocking dielectric 234 can be a single layer of silicon oxide, having a substantially homogeneous composition, or a single layer of silicon oxynitride having a gradient in stoichiometric composition. Using a photoresist mask and etch, the layer 234 can be removed from the MOS regions 212 a-c.

Finally, the process can be continued with either the gate-first process flow illustrated and described above with respect to FIG. 1, or the either the gate-last process flow illustrated and described above with respect to FIG. 3. That is the gate-first process flow can be followed beginning with the forming of metal gates of the MOS transistors 210 a-c, and optionally for the NVM transistor 206 in step 114 and continuing through step 128. Similarly in an alternative embodiment is the gate-last process flow can be followed beginning with deposition of a polysilicon layer and forming of dummy polysilicon gates 250 for the MOS transistors 210 a-c, and optionally for the NVM transistor 206 in step 312 and continuing through step 326.

Another embodiment of a method for integrating or embedding a NVM transistor into a high-k, metal gate CMOS process flow will now be described in detail with reference to FIG. 7 and FIGS. 8A through 8E. FIG. 7 is a flowchart illustrating an embodiment of a method or process flow in which the high-k dielectric material 240 is incorporated into the tunnel dielectric 230. FIGS. 8A-8E are block diagrams illustrating cross-sectional views of a portion of a memory cell 200, including a NVM transistor and MOS transistors, during fabrication of the memory cell according to the method of FIG. 7.

Referring to FIG. 7, as with the methods or process flows described above the process begins with forming a number of isolation structures 202 in a wafer or substrate 204 (step 702), and implanting dopants into substrate 204 through the pad oxide 214 to concurrently form wells in the NVM region 208 and one or more of the MOS regions 212 a-c, and to form channels 218 for MOS transistors that will be formed in the MOS regions (step 704). At this point the memory cell 200 is substantially identical to that shown in FIG. 2B and described above.

Next referring to FIG. 7 and FIG. 8A, a gate oxide or GOX preclean is performed, and gate oxides for MOS transistors 210 a-210 c formed in the MOS regions 212 a-c (step 706). In some embodiments, such as that shown in FIG. 8A, the oxidation process is a dual gate oxidation process to enable fabrication of both a first, thick gate oxide 236 over the surface 216 of the substrate 204 in one MOS region 212 b for a HV transistor, such as I/O FET 210 b, and second thinner gate oxides 238 LV transistors 216, such as NLVFET 210 a and PLVFET 210 c, in the remaining MOS regions 212 a and 212 c. The thick gate oxide 236 and thin gate oxides 238 can be formed as described above in connection with the gate-first method and FIG. 2F.

Referring to FIG. 7 and FIG. 8B, using a photoresist mask and a BOE etch any gate oxide formed in the NVM region 208 is removed to expose the surface 216 in this region, and a high dielectric constant or high-k dielectric material 240 is formed or deposited on or over the NVM region 208 and in the MOS regions 212 a-c to concurrently form a high-k tunnel dielectric 282 and multilayer gate dielectrics in the MOS regions including the gate oxides 236, 238, and the high-k dielectric material 240 (step 708). The high-k dielectric material 240 can include any of the high-k materials described above in connection with the gate-first method and FIG. 2G, and can be deposited by CVD or ALD. It is noted that the embodiment of FIG. 8B differs from those shown and described above in that the high-k tunnel dielectric 282 is formed directly on the surface 216 of the substrate 204 in the NVM region 208, prior to or in place of forming a tunnel dielectric 230 including a silicon oxide, or silicon-oxynitride. It will be understood however that in an alternative embodiment (not shown) the high-k tunnel dielectric 282 can be part of a multilayer tunnel dielectric, formed over a silicon oxide, or silicon-oxynitride formed or grown in the NVM region 208 during or following gate oxidation process.

Next referring to FIG. 7 and FIG. 8C, a patterned tunnel mask 222 is formed on or overlying the MOS regions 212 a-c, and dopants of an appropriate, energy and concentration are implanted through a window or opening in the tunnel mask to form a channel 224 for a NVM transistor 206 (step 710).

Referring to FIG. 7 and FIGS. 8D and 8E, the tunnel mask removed and a number of dielectric or nitride-oxide (NO) layers, shown collectively as NO layers 284, are formed or deposited the surface 216 of the substrate 204, a mask formed on or overlying the ONO layers, and the ONO layers etched to form a gate stack 286 of a NVM transistor 206 in the NVM region 208 (step 712). As with embodiments described above, the charge-trapping layer 232 and the blocking dielectric 234 may include one or more layers of material. In particular, the charge-trapping layer 232 may be or include a multilayer charge-trapping layer including at least an oxygen-rich, substantially charge trap free lower or first charge-trapping layer 232 a closer to the high-k tunnel dielectric 282, and an upper or second charge-trapping layer 232 b that is silicon-rich and oxygen-lean relative to the first charge-trapping layer and comprises a majority of the charge traps distributed in multilayer charge-trapping layer.

Finally, the process can be continued with either the gate-first process flow illustrated and described above with respect to FIG. 1, or the either the gate-last process flow illustrated and described above with respect to FIG. 3. That is the gate-first process flow can be followed beginning with the forming of metal gates of the MOS transistors 210 a-c, and optionally for the NVM transistor 206, in step 114 and continuing through step 128. Similarly in the alternative embodiment the gate-last process flow can be followed beginning with deposition of a polysilicon layer and forming of dummy polysilicon gates 250 for the MOS transistors 210 a-c, and optionally for the NVM transistor 206 in step 312 and continuing through step 326.

Thus, embodiments of methods for fabricating memory cells including embedded or integrally formed ONO based NVM transistor and MOS transistors with high-k gate dielectrics and/or high work function metal gates have been described. Although the present disclosure has been described with reference to specific exemplary embodiments, it will be evident that various modifications and changes may be made to these embodiments without departing from the broader spirit and scope of the disclosure. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.

The Abstract of the Disclosure is provided to comply with 37 C.F.R. §1.72(b), requiring an abstract that will allow the reader to quickly ascertain the nature of one or more embodiments of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, it can be seen that various features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus, the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate embodiment.

Reference in the description to one embodiment or an embodiment means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the circuit or method. The appearances of the phrase one embodiment in various places in the specification do not necessarily all refer to the same embodiment. 

1-20. (canceled)
 21. A memory device comprising: a substrate, comprising a first region and a second region; a non-volatile memory (NVM) transistor, formed in the first region of the substrate, wherein the NVM transistor comprises a first metal gate, comprising a first metal; and a plurality of metal-oxide-semiconductor (MOS) transistors, formed in the second region of the substrate, wherein the plurality of MOS transistors comprises: a first type of low voltage field effect transistor (LVFET), comprising a second metal gate formed over a first gate dielectric, the second metal gate comprising a second metal, different from the first metal, and an input/output field effect transistor (I/O FET), comprising a third metal gate formed over a second gate dielectric, the third metal gate comprising the second metal.
 22. The memory device of claim 21, wherein the first gate dielectric comprises a high-k dielectric layer.
 23. The memory device of claim 21, wherein the second gate dielectric comprises a high-k dielectric layer.
 24. The memory device of claim 21, wherein the NVM transistor comprises a tunnel dielectric, a charge-trapping layer, and a blocking dielectric, wherein the tunnel dielectric is disposed over the substrate, and wherein the blocking dielectric is disposed over the tunnel dielectric.
 25. The memory device of claim 24, wherein the blocking dielectric comprises a high-k dielectric layer.
 26. The memory device of claim 25, wherein the blocking dielectric further comprises a first high-temperature-oxide (HTO), disposed between the high-k dielectric layer of the blocking layer and the charge-trapping layer.
 27. The memory device of claim 24, wherein the tunnel dielectric is a multilayer tunnel dielectric, comprising a bottom layer and a top layer, wherein the bottom layer comprises a high-k dielectric material, and wherein the top layer is disposed over the bottom layer and comprises silicon dioxide or silicon oxy-nitride.
 28. The memory device of claim 24, wherein the charge-trapping layer comprises a first charge-trapping layer and a second charge-trapping layer, wherein the first charge-trapping layer is oxygen-rich, wherein the second charge-trapping layer is disposed over the first charge-trapping layer, wherein the second charge-trapping layer is silicon-rich and oxygen-lean relative to the first charge-trapping layer, and wherein the second charge-trapping layer comprises a majority of charge traps in the charge-trapping layer.
 29. The memory device of claim 24, wherein the first metal gate of the NVM transistor is formed over the blocking dielectric.
 30. The memory device of claim 21, wherein the plurality of MOS transistors further comprises a second type of LVFET, the second type of LVFET comprising a fourth metal gate, the fourth metal gate comprising a third metal.
 31. The memory device of claim 30, wherein the first metal is same as the third metal.
 32. The memory device of claim 31, wherein the second type of LVFET is a p-type LVFET.
 33. The memory device of claim 30, further comprising: a polysilicon gate layer disposed over the first metal gate of the NVM transistor, the second metal gate of the first type of LVFET, the fourth metal gate of the second type of LVFET, and the third gate of the I/O FET.
 34. A memory device comprising: a substrate, comprising a non-volatile memory (NVM) region and a plurality of metal-oxide-semiconductor (MOS) regions; at least one NVM transistor, disposed in the NVM region, each of the at least one NVM transistor comprising a first metal gate, a tunnel dielectric, a multilayer charge-trapping layer, and a blocking dielectric, the first metal gate comprising a first metal, the blocking dielectric disposed over the multilayer charge-trapping layer; and at least one two MOS transistors, disposed in at least one of the plurality of MOS regions, the at least two MOS transistors comprising a first type of low voltage field effect transistor (LVFET), and an input/output field effect transistor (I/O FET), the first type of LVFET comprising a second metal gate, the second metal gate comprising a second metal, different from the first metal, the second metal gate formed over a first gate dielectric; and the input/output field effect transistor (I/O FET) comprising a third metal gate, the third metal gate comprising the second metal, the third metal gate formed over a second gate dielectric.
 35. The memory device of claim 34, wherein each of the tunnel dielectric and the blocking dielectric of the at least one NVM transistor comprises a multilayer structure, and wherein at least one of the tunnel dielectric or the blocking dielectric comprises a high-k dielectric material.
 36. The memory device of claim 34, wherein the first metal gate of the at least one NVM transistor comprises a high work function metal, and wherein the second metal gate of the first type of LVFET comprises a low work function metal.
 37. The memory device of claim 34, wherein the multilayer charge-trapping layer comprises a first charge-trapping layer and a second charge-trapping layer, disposed over the first charge-trapping layer, and wherein the second charge-trapping layer is silicon-rich and oxygen-lean relative to the first charge-trapping layer.
 38. A memory cell comprising: a substrate, divided into at least a non-volatile memory (NVM) region and a metaloxide-semiconductor (MOS) region, wherein the NVM region comprises at least one NVM transistor, comprising a first metal gate, a tunnel dielectric, a charge-trapping layer, and a blocking dielectric, the first metal gate comprising a first metal, wherein the MOS region comprises at least one p-type low voltage field effect transistor (LVFET), at least one n-type LVFET, and at least one input/output field effect transistor (I/O FET), wherein one of the at least one p-type LVFET or the at least one n-type LVFET comprises a second metal gate, the second metal gate comprising a second metal, different from the first metal, and wherein the at least one I/O FET comprises a third metal gate, the third metal gate comprising the second metal.
 39. The memory cell of claim 38, wherein each of the at least one p-type LVFET, the at least one n-type LVFET, and the at least one I/O FET comprises a gate dielectric, disposed over the substrate, and wherein the gate dielectric of the at least one I/O FET is thicker than the gate dielectric of the at least one p-type LVFET and the gate dielectric of the at least one n-type LVFET.
 40. The memory cell of claim 38, wherein the first metal of the least one NVM transistor is a high work function metal, and wherein the second metal of the at least one p-type LVFET or the at least one n-type LVFET is a low work function metal. 